Occluded Video Instance Segmentation: A Benchmark

نویسندگان

چکیده

Abstract Can our video understanding systems perceive objects when a heavy occlusion exists in scene? To answer this question, we collect large-scale dataset called OVIS for occluded instance segmentation, that is, to simultaneously detect, segment, and track instances scenes. consists of 296k high-quality masks from 25 semantic categories, where object occlusions usually occur. While human vision can understand those by contextual reasoning association, experiments suggest current cannot. On the dataset, highest AP achieved state-of-the-art algorithms is only 16.3, which reveals are still at nascent stage objects, instances, videos real-world scenario. We also present simple plug-and-play module performs temporal feature calibration complement missing cues caused occlusion. Built upon MaskTrack R-CNN SipMask, obtain remarkable improvement on dataset. The project code available http://songbai.site/ovis .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MaskRNN: Instance Level Video Object Segmentation

Instance level video object segmentation is an important technique for video editing and compression. To capture the temporal coherence, in this paper, we develop MaskRNN, a recurrent neural net approach which fuses in each frame the output of two deep nets for each object instance — a binary segmentation net providing a mask and a localization net providing a bounding box. Due to the recurrent...

متن کامل

Instance Embedding Transfer to Unsupervised Video Object Segmentation

We propose a method for unsupervised video object segmentation by transferring the knowledge encapsulated in image-based instance embedding networks. The instance embedding network produces an embedding vector for each pixel that enables identifying all pixels belonging to the same object. Though trained on static images, the instance embeddings are stable over consecutive video frames, which a...

متن کامل

Towards a Benchmark for Instance Matching

In the general field of knowledge interoperability and ontology matching, instance matching is a crucial task for several applications, from identity recognition to data integration. The aim of instance matching is to detect instances referred to the same real-world object despite the differences among their descriptions. Algorithms and techniques for instance matching have been proposed in lit...

متن کامل

Recognising Partially Occluded Faces From a Video

Extensive research on video based face recognition has been carried out by researchers in the recent years as security is a major concern in today’s world. Performing a comparison of face recognition with a still face and upon a video, video is given more importance as it can give more information to the user compared with still face image. Even though video can provide widespread information o...

متن کامل

Shape-aware Instance Segmentation

We address the problem of instance-level semantic segmentation, which aims at jointly detecting, segmenting and classifying every individual object in an image. In this context, existing methods typically propose candidate objects, usually as bounding boxes, and directly predict a binary mask within each such proposal. As a consequence, they cannot recover from errors in the object candidate ge...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Vision

سال: 2022

ISSN: ['0920-5691', '1573-1405']

DOI: https://doi.org/10.1007/s11263-022-01629-1